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Abstract —In this paper, we generalize the fundamental 
relation between the derivative of the mutual information 
and the minimum mean squared error (MMSE) to mul¬ 
tiuser setups. We prove that the derivative of the mutual 
information with respect to the signal to noise ratio (SNR) 
is equal to the MMSE plus a covariance induced due 
to the interference, quantified by a term with respect 
to the cross correlation of the multiuser input estimates, 
the channels and the precoding matrices. We also derive 
new relations for the gradient of the conditional and non¬ 
conditional mutual information with respect to the MMSE. 
Capitalizing on the new fundamental relations, we derive 
closed form expressions of the mutual information for 
the multiuser channels, particularly the two user multiple 
access Gaussian channel driven by binary phase shift 
keying (BPSK) to illustrate and shed light on methods to 
derive similar expressions for higher level constellations. 
We capitalize on the new unveiled relation to derive the 
multiuser MMSE and mutual information in the low-SNR 
regime. 

Index Terms —Estimation Theory; Gradient of condi¬ 
tional mutual information; Gradient of non-conditional 
mutual information; Gradient of joint mutual informa¬ 
tion; Information Theory; Interference; MAC; MMSE; 
Mutiuser I-MMSE; Mutual Information. 

I. Introduction 

Connections between information theory and es¬ 
timation theory dates back to the work of Duncan, 
in 111 who showed that for the continuous-time 
additive white Gaussian noise (AWGN) channel, the 
filtering minimum mean squared error (causal esti¬ 
mation) is twice the input output mutual information 
for any underlying signal distribution. Recently, 
Guo, Shamai, and Verdu have illuminated intimate 
connections between information theory and estima¬ 
tion theory in a seminal paper, ||2]]. In particular, Guo 
et al. have shown that in the classical problem of 
information transmission through the conventional 
AWGN channel, the derivative of the mutual in¬ 
formation with respect to the SNR is equal to the 
smoothing minimum mean squared error (noncausal 
estimation); a relationship that holds for scalar, 
point-to-point vector, discrete-time and continuous¬ 
time channels regardless of the input statistics. The 


relevance of these recent connections comes from 
the fact that mutual information and MMSE are 
two canonical operational measures in information 
theory and estimation theory: mutual information 
measures the reliable information transmission rate 
between the input and the output of a system for a 
specific signaling scheme, while MMSE measures 
the minimum mean squared error in estimating the 
input given the output. Later Palomar and Verdu 
generalized this relation to linear vector Gaussian 
channels 11, 0. The mutual information was also 
represented as an integral of a certain measure of 
the estimation error in Poisson channels HI, a. 
There have been extensions of these results to the 
case of mismatched input distributions in the scalar 
Gaussian channel in 0 and 0. Most recently, 
Ghanem in 0, IITOll . derived the gradient of the mu¬ 
tual information with respect to arbitrary parameters 
for the multiple access Gaussian channels, a relation 
that extends the relation for the case of mutually 
interfering inputs in linear vector Gaussian channels 
to the case of multiple non-mutually interfering 
inputs and with mutual interference, a starting point 
to the results in this work. The implications of a 
framework involving key quantities in information 
theory and estimation theory are countless both from 
the theoretical lITTII . IIT^ and the more practical 
perspective, Cni, lUll, O, Ull, lEll. 

The intimate connection between information 
measures and estimation measures allow few ex¬ 
plicit closed form expressions of the mutual in¬ 
formation for binary inputs to be derived, partic- 
ularliy ones for BPSK and QPSK over the Sin¬ 
gle Input Signle Output (SISO) channel, 0, lITSl . 
IIT9l . Therefore, it is of particular importance to 
address connections between information theory and 
estimation theory for the multiuser case in order 
to understand the communication framework under 
such inputs and try to provide explicit forms when 
multiple accessing and interfering inputs coexist. 

In this paper, we first revisit the connections 
between the mutual information and the MMSE for 
the multiuser setup, see also lITOll . ifTTl . Therefore, 
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the fundamental relation between the derivative of 
the mutual information and the MMSE, known as 
I-MMSE identity, and defined for point to point 
channels with any noise or input distributions in 
II 2 II is not anymore suitable for the multiuser case. 
Therefore, we generalize the I-MMSE relation to 
the multiuser case. Moreover, we generalize the 
relations for linear vector Gaussian channels in 
lf3ll to multiuser channels where we extend these 
relations to the per-user gradient of the mutual in¬ 
formation with respect to the MMSE, channels and 
precoders (power allocation) matrices of the user 
and the interferers. Then, we derive new closed form 
expressions for the mutual information for single 
user and mutiuser scalar Gaussian channels driven 
by BPSK inputs. Eurther, we analyze the MAC 
Gaussian channel model at the asymptotic regime 
of low SNR and capitalize on the new unveiled 
connections between the mutual information and the 
MMSE to derive the low SNR expansion of the 
mutual information in a multiuser setup. 

The implications of the derived relations in its 
two-user’s version presented in this paper, or in 
its A'-users version If20]l , where the effect of K — 
1 interferers is also characterized, are many-fold, 
whether to characterize the capacity of interference 
channels, or to characterize novel schemes that 
extends state of art power allocation lfT4ll to ones 
that can capitalize on the characterization of the 
interference effect 0|. Additionally, the extension 
of such relation to wireless networks with noisy 
coded flows is of particular importance iflUl . Il22ll . 
Eor instance, a novel piggybacking capacity achiev¬ 
ing scheme is recently proposed for networks with 
Amplify and Eorward (AE), ll^ capitalizing on this 
result and its characterization of the gap from the 
cut-set upper bound. 

Throughout the paper, the following notation is 
employed, boldface uppercase letters denote ma¬ 
trices, lowercase letters denote scalars. The super¬ 
script, (.)“^, (.)^, (.)*, and (.)^ denote the inverse, 
transpose, conjugate, and conjugate transpose op¬ 
erations. The (V) denotes the gradient of a scalar 
function with respect to a variable. The E[.] denotes 
the expectation operator. The ||.|| and Tr {.} denote 
the Euclidean norm, and the trace of a matrix, 
respectively. 

The rest of the paper is organized as follows; 
section II introduces the system model. Section III 
introduces the new fundamental relations between 


the mutual information and the MMSE. Section IV 
introduces the new closed form expression of the 
mutual inforamtion. Section V introduces analysis 
at the asymptotic regime of low SNR. Section VI 
concludes the paper. 

II. System Model 

Consider the deterministic complex-valued vector 
channel, 

y = snr HiPiXi -f snr H2P2X2 -f n, (1) 

where the x 1 dimensional vector y and 
the rij X 1 dimensional vectors xi, X2 represent, 
respectively, the received vector and the indepen¬ 
dent zero-mean unit-variance transmitted informa¬ 
tion vectors from each user input to the MAC 
channel. The distributions of both inputs are not 
fixed, not necessarily Gaussian nor identical. The 
UrXnt complex-valued matrices Hi, H2 correspond 
to the deterministic channel gains for both input 
channels (known to both encoder and decoder) and 
n CM(0,1) is the Ur X 1 dimensional complex 
Gaussian noise with independent zero-mean unit- 
variance components. 

III. New Eundamental Relations between 

THE Mutual Information and the MMSE 

The first contribution is given in the following 
theorem, which provides a generlization of the T 
MMSE identity to the multiuser case and traverses 
back to the same identilty of the single user case. 

Theorem 1: The relation between the derivative 
of the joint mutual information with respect to the 
snr and the total non-linear MMSE for a multiuser 
Gaussian channel satisfies: 

dl(snr) / N N 

— - -= mmseisnr) -f ipisnr) ( 2 ) 

dsnr 

Where, 

mmse{snr) = Tr {HiPiEi(HiPi)^} + 

Tr {H2P2E2(H2P2)^}, ( 3 ) 

= _Tr{HiPiEy[E„^|y[xi|y]E„^|y[x2|y]t](H2P2)t} 

- Tr {H 2 P 2 Ey [E„^|y [X 2 |y]E„^|y [xi|y]+](HiPi)+ } , 

Proof 1: See Appendix A. 
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The per-user covariance matrix of the estimation 
error, also called the per-user MMSE matrix is given 
respectively as follows: 

El =Ey[(xi -Xi)(xi-Xi)^] (4) 

E 2 = Ey[(x 2 - X 2 )(X 2 - X2)1- (5) 

The input estimates of each user input is given 
respectively as follows: 


Xi =E,,^|y[xi|y] 


Xi,X 2 


^lPy\xi,x2 (y|xi,X2)pa:i(xi)pa:2(x2) 
Pviv) 


( 6 ) 


X2 =Ex2|y[x2|y] 


= E 

Xi,X 2 


^2Py\xi,X2 (y|xi,x2)p xi (xi)p 

X2 (X2) 

Pviy) 


(7) 


The conditional prohahility distribution of the Gaus¬ 
sian noise is defined as: 


Py\x 


dy|xi,x2) = 


IlPlxi —VinfH 2 P 2 X 2 


( 8 ) 


The probability density function for the received 
vector y is defined as: 


Pviy) = Py\xi,X2iy\^1^^2)Pxii^l)Px2i^2)- (9) 

Xi,X 2 


Henceforth, for the case of two-user MAC, the 
system MMSE with respect to the SNR mmse{snr) 
is the MMSE corresponding to the best estimation 
of inputs xi and X 2 upon the observation for a given 
signal-to-noise ratio (SNR), i.e.. 


mmse{snr) = Ey ||HiPi(xi — Exi|y[xi|y])| 


■ Ev 


|H2P2(X2-E,,,|y[x2|y])|r 


( 10 ) 


= Tr {HiPiEi(HiPi)t} -f Tr {h2P2E2(H2P2)^} 
as given in Theorem [B 

Note that I{snr) is the joint mutual informa¬ 
tion in /(xi,X 2 ,y), the term mmse{snr) is due 
to the users MMSEs, particularly, mmse{snr) = 
mmsei{snr) + rnrnse 2 {snr) and iplsrir) are covari¬ 
ance terms that appear due to the covariance of the 
interferers. Those terms are with respect to the chan¬ 
nels, precoders, and non-linear estimates of the user 
inputs. When the covariance terms vanish to zero, 
the mutual information will be equal to the mmse, 
with respect to the SNR. This applies to the single 
user and point to point communications. Therefore, 


the result of Theorem [B is a generalization of pre¬ 
vious result and boils down to the result of Guo et 
al, llBl under certain conditions which are: (i) when 
the cross correlation between the inputs estimates 
equals zero (ii) when interference can be neglected 
or easily removed (i.e. interference is very weak, 
very strong, or aligned) (iii) under the signle user 
setup, (iv) when certain access and power allocation 
scheme is used for inputs Gaussian distributed and 
a successive clean interference estimation process is 
performed, (23]] • 

Such generalized fundamental relation between 
the change in the mutliuser mutual information and 
the SNR is of particular relevance. Eirstly, such 
result allows us to understand the behavior of per¬ 
user rates with respect to the interference due to 
the mutual interference and the interference due to 
other users behaviour in terms of power levels and 
channel strengths. In addition, the result allows us 
to be able to quantify the losses incured due to 
the interference in terms of bits. Therefore, when 
the term 'tjj{snr) equals zero. The derivative of the 
mutual information with respect to the SNR equals 
the total mmse(snr): 

dl(snr) 

— - -= mmseisnr), (12) 

dsnr 

which matches the result by Guo et. al in (211. 


A. The Conditional and Non-Conditional I-MMSE 
and a Remark on Interference Channels 

The implication of the derived relation on the 
interference channel is of particular relevance. To 
particular, it is worth to note that we can capi¬ 
talize on the new fundamental relation to extend 
the derivative with respect to the SNR to the 
conditional and non-conditional mutual information 
components that provides per user rates. To make 
this more clear, we capitalize on the chain rule of 
the mutual information which states the following: 

/(xi,X 2 ;y) = /(xi;y) -f J(x 2 ;y|xi) (13) 

Therefore, through this observation we can conclude 
the following theorem. 

Theorem 2: The relation between the derivative 
of the conditional and the non-conditional mutual 
information and their corresponding minimum mean 
squared error satisfies, respectively: 

ci/(x2;y|xi) 

- - -= mmse 2 {snr) ip{snr) (14) 

dsnr 
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—-—^— = mmseii'ysnr) (15) 

dsnr 

Proof 2: Taking the derivative of both sides 
of (fTSI) . and subtraeting the derivative of J(xi;y) 
whieh is equal to mmsei{'ysnr), 7 is a scaling 
factor, due to the fact that Xi is decoded first consid¬ 
ering the other users’ input X 2 as noise. Therefore, 
Theorem [2] has been proved. 

Note that the derivative of the conditional mutual 
information as well as the non-conditional mutual 
information can be scaled due to different SNRs in a 
two-user interference channel. However, the scaling 
is straightforward to apply. For further details, refer 

to m, m. 

The following theorems in addition to Theorem 
[U generalizes the connections between information 
theory and estimation theory to the multiuser case. 

Theorem 3: The relation between the gradient of 
the mutual information with respect to the channel 
and the non-linear MMSE for a two user-MAC 
channel with arbitrary inputs ([B satishes: 

VH,/(xi,x2;y) = HiPiEiPi^ - H2P2E[x2xl]Pil 

(16) 

VH./(xi,X2;y) = H2P2E2P^ - HiPiE[xixl]P2l 

(17) 

Proof 3: The detailed proof has been provided in 
Appendix B, ifTOl 

Theorem 4: The relation between the gradient of 
the mutual information with respect to the precoding 
matrix and the non-linear MMSE for a two user- 
MAC channel with arbitrary inputs ([B satisfies: 

Vp,/(xi,x2;y) = HlHiPiEi - h1H2P2E[x2x 1] (18) 
Vp,/(xi,X2;y) = HIH 2 P 2 E 2 - h1HiPiE[xix1] (19) 

Proof 4: The detailed proof has been provided in 
Appendix C, ifTOll 

Theorems [3] and |4] provide intuitions about the 
change of the mutual information with respect to 
the changes in the channel or the precoding (power 
allocation). A straightforward connection between 
both changes when each gradient is scaled with 
respect to the changing arbitrary parameter, we can 
write this connection as follows: 

Vp,/(xi,X 2 ;y)Pl = HiVHi/(xi,X 2 ;y) ( 20 ) 

Vp 2 /(xi,X 2 ;y)P^ = H^VH 2 /(xi,X 2 ;y) ( 21 ) 

Note that we can derive the gradient of the mutual 
information with respect to any arbitrary parameter 
following similar steps of the proof of the previous 
two theorems. Note also that the derived relations 


in (fT^ . (fT71) . (fTSl) . and (fT9l) reduce to the relation 
between the gradient of the mutual information 
and the non-linear MMSE derived for the linear 
vector Gaussian channels ||3]| if the cross correlation 
between the input estimates is zero, which applies 
to linear estimation with perfect reconstruction of 
estimates and removal of one estimate from another, 

m. 


B. Gradient of the Conditional and Non- 
Conditional Mutual Information 

Theorem |4] shows how much rate is lost due to 
the other user. The gradient of the mutual informa¬ 
tion provides a set of terms that are associated to 
the mutual interference, which provides a positive 
change, however, the loss is attributed to the effect 
of the non-mutual interference, which appears in the 
second term as a negative change. Therefore, we 
can account for such quantified rate loss via optimal 
power allocation and optimal precoding. In order to 
be able to understand more deeply the achieved rates 
of each user in a MAC channel. Ee capitalize on the 
chain rule of the mutual information to derive the 
conditional mutual information as follows, 

/(xi,X 2 ;y) = /(x 2 ;y) -f /(xi;y|x 2 ) ( 22 ) 


and. 


/(xi,X 2 ;y) = /(xi;y) -f /(x 2 ;y|xi) (23) 


Where the joint mutual information term is defined 
as follows: 


/(xi,X 2 ;y) = E 


_ Pv\x-i_,X'2 (y|xi,x2)_ 

Exi.xi Pylxi.x^ (y|x'i, x'ajpxi (xi)Pxi (x'2) 

(24) 


Where, x) and x'^ correspond to all possible per¬ 
mutations of xi and X 2 drawn from each inputs’ 
constellation set. The non-conditional mutual infor¬ 
mation is defined as follows: 


/(xi;y) = E 


Py|xi(y|xi) 

Ex;l^y|xi(y|x'i)Pxi(x'i) 


(25) 


Where the signal X 2 is considered as noise. 


/(x 2 ;y) = E 


Py|x 2 (y|x 2 ) 


(26) 


Where the signal xi is considered as noise. The con¬ 
ditional mutual information is defined as follows: 
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/(xi;y|x 2 ) = E 


Pv\x[,x2iy\A,^2) 


I{^2;y\^i) = E 


Exi X2 ^^y|xi,X2 (y|x'i, XsKi (^l)Px2 (X 2 ) 

(27) ; 

_ Py\xi,x'^{y\^l,^2) _ 

^Sxi,x^ 7^y| Xi ,X2 (y|xi,x'2)p 

XI (Xi)px^(x'2) 

( 28 ) ■ 

Clearly, we know that /(x 2 ; y) is the mutual in¬ 
formation when user 2 is decoded first considering 
user 1 signal as noise. Therefore, we can write it as 
follows, 


Proof 6: The proof of the corollary follows from 
the chain rule of mutual information and the gradi¬ 
ent of the mutual information derived for the sum 
rate and non-conditional rates. The second term in 
(1^ and (lJ7l) are given in Theorem [51 The second 
term of (l35l) and (l36l) are provided with proof in 

m. 

IV. Mutiuser MMSE and Mutual 
INFORMATION CLOSED FORMS 


/(x 2 ;y) = E 


Py|x 2 (y|x 2 ) 


(29) 


PH|x2(y|x2) = 

g-(y-v 2 snrH 2 P 2 X 2 )t HiPi+I)-l(y-v'inrH 2 P 2 X 2 ) 

Pyiy) = '^PyKiy\^ 2 )Px'^i^' 2 ) (31) 

Based on such definition, we conclude the following 
theorem which provides a new fundamental relation 
between the gradient of the mutual information with 
respect to the precoder of the other user given that 
the other user will be secondly decoded. 

Theorem 5: The gradient of the mutual informa¬ 
tion with respect to the precoder, for a scaled user 
power when the other user input is considered as 
noise is as follows, 

Vp,/(x2;y) = H2P2E2PtHtHitHiPi(ptHtHiPi +I)-i 

( 32 ) 

Vp^/Cxi; y) = HiPiEiPt Ht H2tH2P2(PiHt H2P2 + 

( 33 ) 

Proof 5: The proof follows similar steps of the 
proof of Theorem jH 

When each user is transmitting over a single chan¬ 
nel, the new relation in Theorem [5] will be more 
clearly understood in terms of the effect of the in¬ 
terference plus noise power scaling on the gradient, 
see nni . In other words. Theorem [5] is of particular 
relevance to understand how the rate changes and 
can be adapted based on the changes of the channel 
and power of the interference. 

Corollary 1: The gradient of the conditional mu¬ 
tual information will be as follows, 

Vpi/(xi;y|x2) = Vpi/(xi,X2;y) - Vpi/(x2;y) (34) 

Vp2/(xi;y|x2) = Vp2/(xi,X2;y) - Vp2/(x2;y) (35) 

Vpi/(x2;y|xi) = Vpi/(xi,X2;y) - Vpi/(xi;y) (36) 

Vp2/(x2;y|xi) = Vp2/(xi,X2;y) - Vp2/(xi;y) (37) 


The only known explicit closed forms of the 
MMSE and the mutual information are for BPSK 
inputs, O, and QPSK IITSlI for the SISO channel. 
In the SISO case, the relation between the mutual 
information and the MMSE for the signle user setup 
allows for the derivation of this form. However, for a 
MAC channel, as an example of multiuser channels, 
we will first consider a unit power, unit channel 
gains for simplicity. We will then capitalize on the 
new unveiled multiuser I-MMSE generalization of 
the relation between the mutual information and the 
two user MMSE with the covariance. Therefore, we 
derive new explicit closed form expressions of the 
MMSE and the mutual information for each user in 
the two user Gaussian MAC driven by BPSK. To 
derive a closed form expression of the conditional 
and non-conditional mutual information for each 
user under the MAC, we capitalize again on the 
chain rule of the mutual information stated in (l23l) . 
The first user which will be decoded first given that 
the other user is noise, therefore the MMSE and the 
mutual information of user 1 will be respectively, 
given by the following theorems. 

Theorem 6: The non-conditional mmse'fsnr) of 
the user 1 decoded first and scaled with the other 
user as noise is given by: 


mmse'T(snr) = 1 -= 

4yT 




e ^ dy 
( 38 ) 


Proof 7: See Appendix B, part I. 

Theorem 7: The non-conditional mutual informa¬ 
tion I[{snr) of the user 1 decoded first and scaled 
with the other user noise is given by: 


, , f Jsnr \ 

log cosh ( ^ y I e ^ dy 

( 39 ) 

Proof 8: See Appendix B, part II. 

However, the conditional MMSE and conditional 
mutual information of user 2 that will be decoded 
next under the MAC given that the first user is 
decoded first are given on the following theorems. 


4 4 y 7 r Jy^R 
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Theorem 8: The conditional mmse 2 {snr) of user 
2 decoded second given that user 1 in the MAC 
channel is decoded first with BPSK inputs is given 
by: 


mmse2{snr) = 1 — 






tank (-v/snry) e' 


-(y-y/TTTF)^ 


dy 


(40) 


Proof 9: The proof follows similar steps as in 
Appendix B, Part I, and follows the formula in ||2l|. 

Theorem 9: The conditional mutual information 
l 2 {snr) of user 2 decoded second given that user 
1 in the MAC channel is decoded first with BPSK 
inputs is given by: 


fisnr) 


1 




log cosh (-v/snry) 



Proof 10: The proof follows similar steps as in 
Appendix B, Part II, and follows the formula in |[2l|. 
Notice that if both users are time sharing or decoded 
jointly, at such point, a maximum sum rate is acheiv- 
able, therefore, each user’s rate will follow the one 
in Theorem 0 Such case is similar to two parallel 
channels for each user, therefore, the sum rate is 
the sum of each individual rate. However, from 
Theorems [7] to [8l its straigntforward to conclude the 
following corollaries that defines the total MMSE 
and mutual information of a two user MAC driven 
by BPSK inputs. 

Corollary 2: The total mmse{snr) of two users 
MAC channel with BPSK inputs is given by: 


mmse{snr) 

Where, 


mmse'i (snr)+mmse 2 {snr) 


—tjj{snr) 

(42) 


mmse'fsnr) = mmsei{snr) (43) 


and. 


inputs or the integral of the MMSE of both users. 
However, a general form of the joint mutual infor¬ 
mation that clarifies the new fundamental relation 
with unequal covariances is when f^snr) 0. This 
is given by the following corollary. 

Corollary 3: The total I{snr) of two users MAC 
channel with BPSK inputs is given by: 

I [snr) = I[{snr) + l 2 {snr) (45) 

I[{snr) corresponds to the mutual information of 
user 1 given the other user is considered as noise and 
/^(snr) is the mutual information of user 2 given 
that user 1 is decoded first. 

Proof 12: See Appendix B, part II. 

Eigure 1 illustrates the mutual information per user 
in a MAC and the sum rates under equivalent 
powers and compared to the case of two users over 
SISO parallel channels. Its quite clear now, why 
the mutual information for a MAC Gaussian chan¬ 
nel approaches 1.5 bits/sec/Hz when both inputs 
have similar power, incuring 0.5 bits/sec/Hz loss, 
as previously explained in ifTOll . and why it doesnt 
approach the one of parallel Gaussian channels 
unless unblanced power allocation takes place -the 
so called mercury/waterfilling, which approaches 
2 bits/sec/Hz for BPSK at high SNRs. Moreover, 
when successive decoding takes place, an ufair rate 
allocation takes place, were the user decoded first 
will pay the price from his achievable rates. This 
can be also well explained in terms of the MMSE, 
where the MMSE of the user decoded first has a 
scaled SNR, with a scaling factor less than one. 
This will let this scaled MMSE not to decay to zero, 
however, it saturates at high SNR to a point above 
the zero, at 0.5 for this example. 


mmse' 2 {snr) = mmse 2 {snr) + 'f{snr) (44) 

Proof 11: See Appendix B, part II. 

We shall now capitalize on the unveiled connection 
between the mutual information and the MMSE 
plus the covariance or cross correlation of the input 
estimates. In the specfic case in (ItTI) . and when 
both user inputs are decoded jointly, such covariance 
terms can be easily shown to be equal, over all 
permutations of the inputs. When the inputs are 
orthogonal or time-sharing, the covariance terms 
vanishes, i.e. 'tp{snr) = 0. In turn, the joint mutual 
information is just the sum of the rates of both 


V. Multiuser I-MMSE in the Eow-SNR 
Regime 

We now consider the two-user MAC Gaussian 
channel with arbitrary input distributions in the 
regime of low-snr. Consider a zero-mean uncorre¬ 
lated complex inputs, with E[xix^l] = I, E[x 2 X 2 ] = 
I, E[xiXi^] = 0, and E[x 2 X 2 ^] = 0. We consider 
the low-snr expansion to the MMSE of equation 
m- Note that it can be easily deduced that the 
Taylor expansion of the non—linear MMSE in (fTOl) 
will lead to the first order Taylor expansion of the 
linear MMSE for the Gaussian inputs setup. Thus, 
the low-snr expansion of the MMSE matrix can be 
expressed as: 
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Figure 1. The two-user per MAC rates and sum rates with BPSK 
inputs. 


E = I-(HiPi)^HiPi.snr 

- (H 2 P 2 )^H 2 P 2 .snr -F 0 {snr^), ( 46 ) 

with E = El + E 2 . Consequently, 

mmse{snr) = Tr |HiPiEi(HiPi)^| 

+ rr{H 2 P 2 E 2 (H 2 P 2 )^} ( 47 ) 

= rr{HiPi(HiPi)t} 

+ rr{H 2 P 2 (H 2 P 2 )^} 

-rr{(HiPi(HiPi)^) 2 }.snr 

- Tr {(H2P2(H2P2)^)^} .snr + O(snr^). ( 48 ) 

Note that due to our new result of Theorem [B we 
eannot apply immediately the fundamental relation¬ 
ship between mutual information and MMSE in |[3]|, 
lO. Therefore, the low-snr expansion of '(^{snr), the 
eovarinee or the eross eorrelation between the inputs 
estimates can be expressed as: 

r/;(snr) = -Tr {HiPi(HiPi)^H2P2(H2P2)^} snr 

- Tr {h2P2(H2P2)^HiPi(HiPi)^} .snr ( 49 ) 

Applying our new result, the low-snr Taylor ex¬ 
pansion of the mutual information is given in the 
following theorem. 

Theorem 10: The low-snr Taylor expansion of the 
mutual information of the two user MAC is given 
by: 


/(snr) = Tr {HiPi(HiPi)^} .snr-F 

Tr{H2P2(H2P2)^}.snr 
-Tr{(HiPi(HiPi)^)^}.snr2 
-Tr{(H2P2(H2P2)^)^}.snr2 
- Tr {HiPi(HiPi)^H2P2(H2P2)^} snr"^ 

- Tr {h2P2(H2P2)^HiPi(HiPi)^ } .snr^ -F 0 {snr^) 

(50) 

Proof 13: See Appendix C 
The wideband slope — which indicates how fast the 
capacity is achieved in terms of required bandwidth 

— is inversely proportional to the second order terms 
of the mutual information in the low-snr Taylor 
expansion (l50l) . Therefore, this term is a key low- 
power performance measure since the bandwidth 
required to sustain a given rate with a given low 
power, i.e., minimal energy per bit, is inversely 
proportional to this term tIM . Further, its clear that 
the 5th and 6th term in (l50l) are due to ip{snr), to 
which they play a fundamental role in the rate losses 
encountered at the low-snr regime. 

VI. Conclusions 

We generlize the fundamental relation between 
the mutual information and the MMSE, the I- 
MMSE identity in all its current forms, to a new 
fundamental relation, the Multiuser I-MMSE, which 
applies to multiuser vector and scalar channel se¬ 
tups. Further, we proved our generlization by de¬ 
riving the relation for the joint mutual information, 
conditional and non-conditional mutual information. 
We capitalize on our unveiled generalized relation to 
find explicit closed forms of the mutual information 
and the MMSE of multiuser channels driven by 
BPSK inputs, and to derive the mutiuser I-MMSE 
at the regime of low SNR. Besides, the impact 
of the result is many fold. We mainly quantify 
the data rate losses due to the interference, which 
constitutes the gap from the cut-set upper bound. 
This gap / ip{snr)snr in MAC channels as a special 
case of interference channels is characterized with 
respect to the users channels, precoding (power 
allocation) and the decoding or estimation process 
of inputs for the first time. In turn, we allow for fu¬ 
ture characterization of the capacity of interference 
channels. Additionaly, this result allows for future 
investigation and characterization of the network I- 
MMSE. On the other hand, the new fundamental 










relation will have high impaet on future designs of 
transmission sehemes that are interference-aware, 
due to the awareness of the covariance (gap) in¬ 
troduced due to the interference. It will also have 
impact on statistical signal processing applications 
that are based on classification of mixtures of data 
in a measurement system. 

VII. Appendix A: Proof of Theorem □ 

The conditional probability density for the two- 
user MAC can be written as follows: 


Py|xi.X 2 (y|xl,X 2 ) = —e 


— 2/— \/snr H1P1X1 — A/inr H 2 P 2 X 2 


Thus, the corresponding mutual information is: 

Py\xi,X2 (y|xi,X 2 


/(xi,X 2 ;y) = E 


log 


Py{y) 


dsnr 


dsnr 


Pyiy) 


dsnr 


dsnr 


d 


-PH|2:i,a:2(y|xi.X2);T- (y “ v/inrHiPixi - y3nrH2P2X2)^ 


dsnr 

(y — y/ snrY 


^ ((HiPiXi)t + (H 2 P 2 X 2 )t) 

'snr \ / 


(y — y snrHiPixi — yj snrH2P2X2) X 


(^(HlPlXl)t + (H 2 P 2 X 2 )'I‘) VyPy| 2 ,j^a: 2 (y|xi,X 2 ) 

( 61 ) 


Therefore, we have: 

[VsnrPH| 2 :i,a: 2 (yl^l>X 2 )] = 


((HiPlXl)t + (H 2 P 2 X 2 )t) VyPy| 2 ,j,a: 2 (y|xi,X 2 ) 
'snr \ j 


( 62 ) 


Substitute (1^ into (1571) . we get: 

yjsnr J 


dsnr 


,i,,2[((HiPixi)t + (H2P2X2)' 


(51) 


( 52 ) 


/(xi,X2;y) = -Urlog^TTe) -E[log {py{y))] ( 53 ) 
/(xi,X2;y) = -nrlog{ne)- Py{y)log {py{y)) dy 

( 54 ) 

Then, the derivative of the mutual information with 
respect to the SNR is as follows: 

(i/(xi,X2;y) d 


VyPy|xi,x 2 (y|xi.X 2 )]dy ( 63 ) 

= f {I + log {py{y))) ((HiPiXi)t + (H2P2X2)t) 

y/snr J ^ / 

VyPa|a:i,a: 2 (y|xi>X 2 )<^y)] (^4) 

Using integration by parts applied to the real and 
imaginary parts of y we have: 


J {1 +log (py{y))) 


dPa|a:i, 2 , 2 (yl^l>^ 2 ) 


dt 


dt = 


J d + log{pyiy)))py\^^^^^ (y|xi,X2)| 
1 dPviy) 


/ 


Pi;|a:i,a;2(y|xi>X2)dt (65) 


Py{y)log {Py{y)) dy 

( 55 ) 

Pp(y)^T + ^09 (Pp(y))l 


The first term in 
Therefore, 

(l/(xi,X2;y) _ 1 


( 56 ) 


= -j {l + log{py{y)))^^^dy (57) 

Where the probability density function of the re¬ 
ceived vector y is given by: 

Py^y)= Pp|xi,x 2 (y|xi,X 2 )p X\,X2 (xi,X2) (58) 

Xi,X 2 

= ^xi,X2[Py\xi,X2iy\^l,^2)] (59) 

The derivative of the conditional output with respect 
to the SNR can be written as: 

9P;,|xi,X2(y|xi>X2) _ 


dsnr 


d7(xi,X2;y) 1 


-oo Pyiy) dt 

goes to zero as ||y|| —)■ cx). 

^ 1,^2 [-/(((HiPiXi)t + (H 2 P 2 X 2 )t)x 


P^|3;i,rc2(yl^l’^2) 

Pyiy) 


VyP!,(y)dy)] (66) 


I J ^yPy(y)> 


dsnr 

E:.1..2[((HlPiXl)t + (H2P2X2)t) ^^'"^’""^f^^^’''''V y (67) 

Pyiy) 


dJ(xi,X2;y) 

dsnr 


However, 


7^ [ VyPy(y)E2,i,:c2[(HlPl)tE^ I [xi|y]t 
/snr J 

+ {H2P2)tE,^|j,[x2|y]t]dy (68) 


[iPixi — \/3nrH2P2X2) (60) 


^yPyiy) = VyEa;^,j ;2 [Py\xi,X 2 iy\^-i->^^)] 

= T^xi,X 2 [VyPy|^j,a; 2 (y|xi,X 2 )] 

= -Exi,X2 [Pi/|a:i.2;2(yl^l>^2) (y - \/inrHlPlXl - V^H2P2X2)] 
= -Ej:j,a:2 [Pyiy) (y - Vsnr-HiPiXi - v^infH2P2X2) |y] 

= -Pyiy) (y - C^HiPiE 3 .j|j,[xi|y] - VSirH 2 P 2 E^ 2 |y[x 2 |y]) 


(69) 


Substitute (1691) into (1681) we get: 






























d/(xi,X2,y) ^ _l f Py{y){y - v/SifHiPiE^ I [xi|y] 
dsnr y/snr J ' 

- v^ 5 nrH 2 P 2 Ej; 2 |y[x 2 |y])x 

((HiPi)tE,^|„ [xi|y]t + (H2P2)tE,^|^ [x2|y]t) dy 

dI(xi,X2;y) ^ ^^]g^[y^t](HiPi)t + ^^Ey[yxt](H2P2)t 
dsnr y/snr vsnr 

-Ey[HiPiE2,j|y[xi|y]E^j|j,[xi|y]t](HiPi)t 

- Ey [HiPiE^j Iy [xi |y]E^ 2 Ih [^ 2 lyl^1 (H 2 P 2 )''' 

- Ey [H2P2E2,2|y[x2 lyJE^.^|y [x2|y]t] (H2P2)t 

-Ey[H2P2E3.2|y[x2|y]E^^|y[xi|y]t](HiPi)t 

Therefore, 

dJ(xi,X2;y) ^ jjiPiE^i [xix+](HiPi)t 
dsnr 

-HiPiEy[E,^|y[xi|y]E,^|^[xi|y]t(HiPi)t 

+ H2P2E:,2[x2xt](H2P2)t 

- H 2 P 2 Ey[E 3 . 2 |y[x 2 |y]E 3 . 2 |y[x 2 |y]hH 2 P 2 )''' 
-HiPiEy[E,^|y[xi|y]E„^|y[x2|y]t](H2P2)t 

- H2P2Ey [E^2 |y [x2 |y]E^^ |y [xi |y]t](HiPi)t 


dl {xi,X 2 -,y) =HiPiEi(HiPi)t + H 2 P 2 E 2 (H 2 P 2 )''‘ 
dsnr 

- HiPiEy[E 2 ,j|y[xi|y]E^ 2 |y[x 2 |y]t](H 2 P 2 )t 

- H 2 P 2 Ey[E^ 2 |y[x 2 |y]E^^|y[xi|y]t](HiPi)t 

Therefore, the derivative of the mutual information 
with respect to the SNR and the per users mmse 
and input estimates (or covariances) is as follows: 


dJ(xi,X 2 ;y) 

dsnr 


= mmse\(snr) -\- mmse2{snr) 

- Tr |HiPiEy[xixt](H 2 P 2 )+} 

-Tr |H 2 P 2 Ey[S 2 Sl](HiPi)t} 


Therefore, we can write the derivative of the 
derivative of the mutual information with respect 
to the snr as follows: 

dlisnr) 

— - -= mmse{snr) + 'ipysnr) (70) 

dsnr 

Therefore, Theorem [U has been proved as a gener¬ 
alization of the one hy Guo, Shamai, Verdu in lO 
to the multiuser case. 


VIII. Appendix B: Multiuser MMSE(snr) 
AND I(SNR) FOR BPSK INPUTS 

A. Part I 

Consider the simplified case for a channel model 
given by: 

y = y/ snr xi -t- y/ snr X 2 + n, (71) 
The total MMSE is given as: 

nimse{snr) = mmsei{snr) + mmse 2 {snr) (72) 


Therefore, we can write the non-linear MMSE ma¬ 
trix for each user respectively as: 

El =E[(xi -E[xi|y])(xi - E[xi|y])'''] 

= E[xixl] - E[E[xi |y]E[xi |y]], ( 73 ) 


Ea =E[(x 2 -E[x2|y])(x2 - E[x2|y])'t'] 

= E[x 2 x^] - E[E[x 2 |y]E[x 2 |y]], (74) 


with. 


E[xi|y] = 


Sxi ,X2 ^lPy\xi ,X2 (y I Xl, X2 )Pxi (Xl (X2 ) 


Pyiy) 


Exi ,X2 ^lPy\xi ,X2 (y I Xl, X2 )Pxi (Xl (X2) 


(75) 


Exi,x^ Py|xi,x^(y|x'i, x' 2 )py,;(x;)py,/ (x' 2 ) 


E[x2|y] = 


(76) 

^X1,X2 ^^Py\xi,X2 (y |xi, X2)Pi:i (xi)p^2 (^2) 


Pyiy) 


(77) 


_ Exi ,X2 ,r .2 (y I Xl , X2 )p2:, )Px 2 (^2 ) 

Ex', ,x(, Py|x'„x^(y|x'i, X'2)p^/ (X'i)py,/^(X'2) 

(78) 

Its of particlar importance to notice that the condi¬ 
tioning over X 2 in Py|xi,x2(y|xi,X2) inside E[xi|y] 
does not correspond to knowledge of the message 
X 2 , but to considering it as noise in this setup. There¬ 
fore, we yet account for px^{-K 2 ). On the other hand, 
its worth to note also that the conditioning over xi 
in Py\xi,x 2 (y |xi, X2) inside E[x2 |y] is just for clarity. 
However, if xi is decoded first, then when decoding 
(estimating) X 2 next, we remove Xi. Therefore, 
E[x 2 |y] is in fact equal to E[x 2 |y — xi], this will 
make Py|xi.x 2 (y|xi, X 2 ) equals to Pj;|a: 2 (y|x 2 ) and 
Pxii^i) will be absent accordingly from the equa¬ 
tion. 

Eor the two user MAC driven by BPSK inputs, 
the values of xi = {1, —1} and X 2 = {1, —1}. The 
non-linear estimates in (l76l) and (1781) consider that 
both user inputs are decoded jointly. However, we 
are interested in successive decoding of the users 
inputs. 

Therefore, the non-linear estimate of user 1 de¬ 
coded first in the MAC considering user 2 as noise 
that scales the SNR of user 1, and so we can write 
it with respect to all possible permutations of the 
possible inputs of user 1 as follows: 


Exi ^^Py\^l,X2 > X2)Pl.i (Xl)px2 (X2) 

Ex', Py|x', (y|xi, X2)Pi,', (x'i)pi,2 (X2) 


(79) 
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]E[xi|y] = 

However, 

E |^E{xi|y)E{xi|y)tj 


4 — e J 


-(y-V^)2 


(y+V^) 


(80) 


1 into (17^ the scaled MMSE of user 1 over a MAC 
channel with BPSK inputs is given by: 


. 1 /" (\/snr \ 

mmsei{snr) = 1-— / tank I -y ) 

4V^ Jy£R V 2 J 


e 4 cZy 


/( 


Exi XiPy|:ci.a:2 (V |xi, X2 )p3:i {xi)px2 (X2) 


Py{y) 


Pyiy)dy 

(81) 


E[E(xi|y)E(xi|y)t] = 

-! 

Stt J 


-(y-Venr)^ ~(y + VBnr)^ 

e 3 — e J , -(y- 

(e 


-(y-^ 


F)4 


(y+yin7)4 


— e 5 


Therefore, Theorem [ 6 ] has been proved. 


B. Part II 

Due to the relation between the MMSE and the 
mutual information for SISO channels, the mutual 
information for user 1 decoded hrst and with a 2a^ 


Digging into the depth of the right hand side of 
equation ( 1821 ) . we have: 


)dy scaled snr is given by: 

(82) = flE _ J_ / log cosh (^y) 


-(y-v 


3! 


(y — \/snrY = — 2yjsnry + snr 


(83) 


Thus, 


dy 
(91) 

Where cr^ = 2 is the sum of the noise and inter¬ 
ference power variance. Eollowing similar steps to 
the ones above, user 2 will be decoded next given 
(y -I- ^snrY = y^ + 2 \/snry -f snr (84) (conditioned) on the knowledge of user 1 who is 

decoded first, therefore, the non-linear estimate of 
user 2 message removing user 1 message is: 

Ex2 ^2Py\x2{y\^2)Px2{^2) 


-{y—ysnr)"^ 


— e 


— (y+v/anr)^ 


e 2 


-y _ 


— (y— -\/snr)2 — {y-\-ysnr)^ 

e 4 -(- e 4 


E[x 2 |y] = 


e 2 y + e 


= tank 


f y/snr 


ny) 


(85) 

( 86 ) 


Ex'Py|x'(y|x2)Px2(x2) 


-(y-v 


E[x 2 |y] = — 


— (y+\/snr)-^ 

e 2 


(y — y'snr)^ — (y+y'snr)^ 

e 2 -1- e 2 


(92) 

(93) 


It follows that: 

E [E(xi|y)E(xi|y)t 


Eollowing the same steps as before, and capitalizing 
on the new unvelied relation, the MMSE of user j. 
will be given by. 


— f tank ( - 
Stt J V 


-ny) 


-(y- 


21 -(y + VI^)-^ 

— e 4 


dy 

(87) 


!' _ _ ^ — (y — y'snr)^ 

mmse 2 {snr) = 1 - -= / tank [\/snry) e 2 dy (94) 

Jy^R 


Therefore, 

E |^E{xi|y)E(xi|y)tj 


and the mutual information for user 2 decoded next 
is given by: 


/ tank 1 

\/snr ^ \ 

Jy£C \ 

2 J 


-(y- 


lUsnr) = snr - f 


ySR 


_ ^ — {y — y/snr)'^ 

log cosh (v/sriry) e 2 dy (95) 






However, it is known that: 

tanh{—x) = —tanh{x) (89) 

an d th e expectation remains the sa me if y Af 
{ysnr, 1 ) replaced by y ~ A" 
symmetry, therefore, we have: 

e[e{ Xl |yWxi|y)t] 


Notice that due to the new fundamental relation 
between mmse(snr) and the mutual information, 
'dy ( 88 ) observe the effect of the covariance terms 

il^^snr), given as; 


iP(snr) = -Ey[E„^|y[xi|y]E„2|y[^2|y]^] 

-E!/[Ex2|y[x2|y]E„^|y[xi|y]t] (96) 

Eollowing similar steps to the ones before, we 
ly rVplaEd by'y AA* 7 -"^^,“i)‘; due'to see that the covariance term will have a negative 

value which explains the loss in the mutual infor¬ 
mation in I[{snr), and correspondingly to I (snr). 
Therefore, the covariances of such setup are given 
as: 


e 4 dy (90) 


Therefore, due to marginalization of the complex 
domain into the real domain, substituting E[xix^] = 


Ey[E^i|y[xi|y]E^,|y[x2|y]^] 
1 




tank 


veR 


-(y-V^)2 


y e 


-dy (97) 

























































11 


and, 


Ej,[E^^lj^[x2|y]E^^|y[xi|y]'l’ 

1 


4\/^, 


VGR 


tank {\fsnry) e 




dy (98) 


Both terms are not equal, which can be explained 
by Py{y) that is different in the integration based on 
who is decoded first. 

The new fundamental relation between the mutual 
information and the MMSE plus covariance states 
that. 


dl{snr) 

dsnr 


= mmsei{snr) + mmse 2 +'ip{snr) (99) 


However, we derive the mutual information I[{snr) 
based on the following: 


dl[{snr) 

dsnr 


= mmse'i{snr) 


( 100 ) 


and the mutual information I' 2 {snr) was derived 
based on the following: 


dH^isnr) 


dsnr 

It follows from (l99l) that, 

dl2{snr 
dsnr 

Therefore, 

dl2{snr 


= mmse2{snr) 


= mmse2{snr) + ifj^snr) 


dsnr 

This means that: 


dli^^snr) 

' = 'ijj(snr) 


dsnr 


( 101 ) 


( 102 ) 


(103) 


mmse{snr) = mmsei{snr) + mmse 2 {snr) 

= mmse\{snr) + mmse 2 {snr) 

= mmse'i(snr) + mmse' 2 {snr) — 'il){snr) (104) 

and, 

I{snr) = I[{snr) + l 2 {snr) (105) 

I{snr) = I[{snr) + l 2 {snr) — J 'tp{snr)dsnr 

(106) 

Moreover, due to the reasons discussed earlier, or 
when both inputs are decoded jointly, those co- 
variance terms in 'tp{snr) might collapse to zero 
and so the mutual information will be the sum of 
the integral of both users MMSEs mmsei{snr) + 
mmse 2 {snr). Therefore, Theorem [7] has been 
proved with its following corrollaries. 


IX. Appendix C: Proof of Theorem [TO] 

First we will find the low-snr expansion of the 
MMSE matrix Ei -|- E 2 for user 1 and user 2 
in (l46l) as snr —)■ 0. Therefore, we will first derive 
the low-snr expansion of the conditional probability 
exponent given as: 

|y — -x/sm-HiPiXi — y/ snr'H.2P 2^2]^ 

= (y — -v/sw-HiPiXi — y/snr'H.2P2^2y x 
(y — i/snrHiPiXi — y/ snr'H.2P 2^2) 

= |y|^ - ^/snr (^y'l'HiPiXi -f (y^HiPiXi)^^ 
-hsnr(HiPiXi)tHiPiXi 
+ snr (HiPiXi)^H 2 P 2 X 2 
— snr (^y^H 2 P 2 X 2 -I- (y^H 2 P 2 X 2 )^) 

+ snr (H2P 2X2)^112^ 2X2 -|- snr (H 2 P 2 X 2 )^HiPiXi 
= |yp - 2 y/s^n (y^'HiPiXi) 

+ snr |HiPixip -I- snr(HiPixi)^H 2 P 2 X 2 
— 2 yJsnrTZ (y^H2P2X2) 

-I- snr|H 2 P 2 X 2 |^ 

-|- snr (H 2 P 2 X 2 )^HiPiXi 


Hence, 

Py\^i,^2 (y|xi,X2) = 


exp (—|y — -^/snrHiPiXi — VsnrH2P2X2i 


.g(-|ylO^ 


^(2 v^inr■R.{ym 1P1X1) + 2 v'snr7^(y t H2 P2X2 )) 

2 

(snr iHjPjXj p+snr (H2P2^2 Hi P1X1+snr (HiP^xi) tH2 P2^2 ) 

e 

However, due to: 


/■ / - , \ exp (ay/snr) 

exp (a\/snr — bsnr) = - - -— (107) 

exp [bsnr) 

The Taylor expansion of the numerator and the 
denominator of (11071) as snr —)■ 0, is given as, 

. . - , 1 -|- aJsnr -|- O (snr) 

exp (av snr — bsnr = -—-r- 

^ 1 + 0 {snr) 

= 1 -|- ay/snr + O {snr) 

Therefore, the low-snr expansion of the conditional 
probability distribution of the Gaussian noise is 
defined as: 

Py\xi,X2 (y|xi,X2) = 

^exp(-|y|2)x 

(1 -I- 2 y/snrTZ (y'^HiPiXi) -|- 2 +snrTZ (y'^H2P2X2) 

+ 0 {snr)) ( 108 ) 
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A. Derivation of the Multiuser MMSE at the Low 
SNR 

The first term of the MMSE matrix of user 1 Ei 
is E[xiXi^] = I. However, to find the seeond term 
of El, E[xi|y] is defined as: 


E[xi|y] = 


Exi,X 2 XiPy|^i,^2(y|Xi,X2Ki(Xi)p^2(x2) 


Pyij) 


(109) 


We need to substitute (11081) into (11091) as follows: 


V.y\E^^\y[xi\y\ (E^^|y[a;i|y])^] 

= / ^exp(-|y|^)p^i (xi)p^2 (xa) 

Jy^C^r TT'*'- 

2ysnf7^ (y^HiPiXi) + O {sr 


Ex' x' 1 + E 2yinf7^ (ytHiPixl) + O [snr] 

^ ' i=l 
2 


( xi ( 1 + 2fsnrTZ ^y^HiPiX,^ + O (snr) j )^dy 


exp(-|y|^)x 

{ ^ (1 + 2i/srif7?.(ytHiPiXi) 

XI ,X2 

+ 2^ snr'fl{y^ 'H 2 P 2 ^ 2 ) 

+ C>(snr))pxi (xi )px2 (xa)) 

= exp(-|yp)x 
(l + \/snf ^ (y+HiPixi 

Ti ,X2 

+ (ytHiPix'i)Tpa;i(xi)px2 (X2) 

+ i/srif ^ (ytHaPax'a + (ytHaPax2)T + 

cci,a:2 

= exp(-|yp)(l+ 

\/sr?r(y^Hi PlExi [xi] +Exi[xit](ytHiPi)t) 
v/ 3 nf(ytHaPaEx 2 [xa] + Exj [xafi(y'''HaPa)T + C>(snr)) 

= exp (-|yp)(l + 0(snr)) 


Therefore, 


Ey[Ex,|j;[xi|y] (Exi|j,[xi|y])'''] = 

JyeC”-r TT"'- 

snr (HiPi)^ y + O {snr)j (^^/snr (HiPi)^ y + O {snrfj 
1 + 0 (snr) 


Recall that E[xi] = = 0^ 

E^-^^^-JxiXi^] = 0, E2 .j,^JxiX2^] = 0, 

Exi,x 2 [x 2 Xi^] = 0, E^^j^^Jxixi] = I, and 

Exi,x2[x2X^] = I. 

Therefore, the numerator of (11091) is given by. 


( 110 ) 


/ 

JyGC-^ 


-jj-nr 


exp(-|y|^) 


snr (HiPi)tyytHiPi + O (^snr^') 


1 + 0 (snr) 


dy 


( 111 ) 


exp(-|yP)x 

(E Xi(l + 2v/snf7?.(ytHiPixi))pxi(xi)px2(x2) 

>^2 

+ 2Vsnr’7^(ytH2Pax2) + C>(snr))pxi (xi)px2 (x2) 

= exp(-|y| 2 )x 

Exi,a:2[xi] + \/snfExi,x2[xi](y''‘HiPiXi + (ytHiPiXi)t) 
+ VsnrExi,x 2 [xi](y'^H 2 P 2 X 2 + {y^H2P2X2)T + O(snr) 

= exp(-|yp)x 
,X2 [^ 1 ] 

+ v/srifExi,x2[xi]((xi^Pi'^Hi^y*)^ + (HiPiXi)ty) 

+ i/sriTExi,x2 [xi]((x2^P2^H2 V)"^ + (H2P2X2)V) 

+ 0{snr) 

= exp(-|yp)x 

v/snr(Exi,x 2 [xix^](HiPi) V) + O(snr) 

= exp (-|yp)yCTiT(HiPi)ty + O(snr) 


It follows that: 

Ey[Exi|y[xi|y] (Ex,|y[xi|y])^] 

= (HiPi)^HiPisnr + (!)(snr2) (112) 

Consequently, the low-snr expansion of the MMSE 
matrix of user 1 Ei is given as follows: 

El = I-(HP)tHiPi.snr+ C>(snr2) (113) 

Similarly, the low-snr expansion of the MMSE 
matrix of user 2 E 2 is given as follows: 


dy 


and the denominator of (11091) is given by. 


E 2 = I - (H 2 P 2 )^’H 2 P 2 .snr + 0{snr^) (114) 
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Therefore, we ean express the low-snr expansion of 
the total MMSE in terms of the snr as follows: 

MMSE {snr) = Tr {HiPiEi (HiPi)^} 

+ Tr {H2P2E2 (H2P2)^} 

= Tr {HiPi (i - (HiPi)^ HiPisnr + O (snr^)) (HiP 
+Tr {H2P2 (l - (H2P2)^ H2P2snr + O (snr^)) (H2P2 
= Tr {HiPi (HiPi)^} - Tr I (HiPi (HiPi)^)'| snr 

+ Tr {H2P2 (H2P2)^} - Tr I (H2P2 (H2P2)^)'| snr 

+ O {snr'^) 


B. Derivation of the Multiuser Mutual Information 
at the Low SNR 

We shall now eapitalize on the unveiled gener¬ 
alization of the fundamental relation between the 
mutual information and the MMSE plus covariance. 
Therefore, using similar steps to derive the low-snr 
expansion of the covariance given by. 


f{snr) = 

- Tr { HiPiEy [xi |y]E,,,|y [x2 |y]t] (H2P2)^ } 

- Tr {H2P2Ey[E,,,|y[x2|y]E,,,|y[xi|y]t](HiPi)t} , 

Substituting the low-snr expansion of Ea,^|y[xi|y] 
and E 3 . 2 |j^[x 2 |y] into the covariance, the low-snr 
expansion of covariance 'ijj{snr) as snr —)■ 0 is given 
by: 


f!{snr) = 

- Tr |HiPi (HiPi)^ H 2 P 2 (H 2 P 2 )^} snr 
- Tr IH 2 P 2 (H 2 P 2 )^ HiPi (HiPi)^} snr 

Therefore, capitalizing on the fundamental relation 
which states that, 

dl(snr) . , ,, , 

— - -= mmse[snr) -f ip[snr) (115) 

dsnr 

The mutliuser mutual information at th e low snr 
regime is the integral of both sides of (II151) . and 


so its given by: 

/(snr) =Tr{HiPi(HiPi)^|snr 

-/Tr{H2P2(H2P2)^}snr 

-Tr{(HiPi(HiPi)^)"}snr2 

1 )^} -Tr{(H2P2(H2P2)^)^}snr2 + 

)^} + Tr {HiPi(HiPi)^H2P2(H2P2)^} W 

- Tr {H2P2(H2P2 )^HiPi(HiPi)^| snr^ C>(snr 3 ) 

( 116 ) 

Therefore, Theorem [10] has been proved. 
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